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We address the problem of distinguishing among a finite collection of quantum states, when the 
states are not entirely known. For completely specified states, necessary and sufficient conditions 
on a quantum measurement minimizing the probability of a detection error have been derived. In 
this work, we assume that each of the states in our collection is a mixture of a known state and an 
unknown state. We investigate two criteria for optimality. The first is minimization of the worst- 
case probability of a detection error. For the second we assume a probability distribution on the 
unknown states, and minimize of the expected probability of a detection error. 

We find that under both criteria, the optimal detectors are equivalent to the optimal detectors of 
an "effective ensemble" . In the worst-case, the effective ensemble is comprised of the known states 
with altered prior probabilities, and in the average case it is made up of altered states with the 
original prior probabilities. 
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I. INTRODUCTION 

Quantum detection refers to the retrieval of classical 
information encoded in a quantum-mechanical medium. 
Representing the information as one of m possible mes- 
sages, it is assumed that this medium has been prepared 
in a quantum state drawn from a collection of m known 
states, each associated with one of the messages. The 
medium is then subjected to a quantum measurement, 
in order to determine the prepared state. If the quantum 
states are not mutually orthogonal, then no measurement 
will distinguish perfectly between them. One, then, seeks 
a measurement scheme (detector), which optimally dis- 
criminates between the states in some sense. A popular 
criterion of optimality is minimization of the probability 
of a detection error. 

Possible applications for distinguishing between quan- 
tum states are digital communication via a quantum 
channel, or the output module of a quantum computer 
0. In theoretical quantum computation, the possible 
outcomes of a calculation are normally mutually orthog- 
onal, making the discrimination between the results triv- 
ial. In this paper, however, we address questions of im- 
perfections in the setup, making the results relevant to 
the implementation of working quantum computers. 

We consider an ensemble of quantum states, consisting 
of TO density operators {pi}^i on an rt-dimensional com- 
plex Hilbert space Ti,, with prior probabilities {pi}^i- A 
density operator p is a positive semidefinite (PSD) Her- 
mitian operator with Tr(p) = 1; we write p > to in- 
dicate that p is PSD. For our measurement we consider 
general positive operator- valued measures |2, |^ , consist- 



ing of TO PSD Hermitian operators {11^}™]^ that form a 
resolution on the identity on Ti.. 

For a completely specified state set {pi}™ i, necessary 
and sufficient conditions for an optimal measurement, 
which minimizes the probability of a detection error, have 
been derived 0, |E IB] • Explicit solutions to the problem 
are known in some particular cases 13, la, S HOi Hill j in- 
cluding ensembles obeying a large class of symmetries 

For arbitrary state sets, the problem of finding the op- 
timal measurement can be cast as a semidefinite pro- 
gramme (SDP) [g, which is a tractable convex optimiza- 
tion problem By exploi ting the many well-known 
algorithms for solving SDPs [ijj i the optimal mea- 
surement can be computed very efficiently in polynomial 
time within any desired accuracy. 

As with most physical systems, typically, one does not 
have full knowledge of the parameters. When applying 
the measurement, the states pi are often unknown to a 
certain extent, whether due to degradation (noise, deco- 
herence [T^ ) in the quantum medium, or to imperfect 
preparation. In this paper we investigate the effects of 
uncertainty in the states pi on the optimal measurement. 

To model the uncertainty we assume that each state 
Pi is a mixture of a known state and an unknown state 



P^ = qiP° + (1 - qi)pl 



(1) 



1neIron@tx. technion . ac . ill 
^ |yomna@ee. technion . ac . il| 



where the states p^ and < < 1 are known, and 
the operators pj are completely unspecified, except for 
being valid quantum states. The parameters qi serve as 
a bound on the amount of mixing of each state. 

A different detection strategy, known as unambiguous 
detection |l7lll8llT9l |. is to design a measurement of order 
TO-I- 1, where the extra answer stands for an inconclusive 
result. If the measurement returns an answer, then it 
is correct with probability 1. The goal is to design the 
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measurement, so that the probabihty of an inconclusive 
result is small. When the states that are to be detected 
are uncertain as in ensuring perfect detection of a 
state is impossible. For this reason we choose not to 
pursue this strategy. 

Detection of uncertain states has so far been addressed 
in the special case, where the quantum medium is the 
free space channel and the known states pf are coher- 
ent states, i.e. pure states ja^), each characterized by a 
complex number '23|. Vilnrotter and Lau ^ij model 
thermal noise as a probability distribution over a finite 
subset {ctj} of the complex plane, and then find the opti- 
mal measurement for states mixed according to this dis- 
tribution, i.e. = qj\ai-\-aj){ai + aj\. Concha and 
Poor use Ohya's model of quantum channels to 
model thermal noise in a multiaccess quantum free space 
channel. Both works rely heavily on the simple param- 
eterization of coherent states. This type of parametric 
averaging may not suit all quantum systems. In addi- 
tion, the method we propose in this work assumes far 
less knowledge (only the probability go associated with 
a, = 0). 

The measurement minimizing the probability of a de- 
tection error depends, in general, on the states pi. There- 
fore, if the states are not known exactly, then the optimal 
measurement cannot be determined. Here, we develop 
two approaches to detection in the presence of state un- 
certainty. The first strategy is motivated by the recent 
theory of robust optimization [23 . 125I |2^ , in which the 
worst-case solution is optimized. Adapting this method 
to our particular context, we consider maximizing the 
smallest possible probability of correct detection, over 
all quantum states of the form Robust semidefinite 
programming has already been introduced to the domain 
of quantum information in the context of entanglement 
witnesses 27\. Our second strategy is to define a prob- 
ability distribution over the region of uncertainty, and 
then maximize the average probability of correct detec- 
tion. This strategy is conceptually similar to those used 
in |2ll I22I , but does not rely on a particular quantum 
system. 

In Section^we present the problem in detail and state 
some known results. Section IIIII is an analysis of the 
worst-case approach to minimal detection error, and in 
Section 1^ we address the question of optimal detection 
on average. In both cases we find that the optimal mea- 
surement for the uncertain ensemble is equivalent to an 
optimal measurement for an "effective ensemble". The 
effective ensemble for worst-case detection is comprised 
of the known states with altered prior probabilities, 
whereby a bias towards the states which are more cer- 
tain is introduced. In the average case, it is made up of 
the states qip^ + ^—^I, with the original prior probabil- 
ities. We show explicitly that averaging over the region 
of uncertainty is equivalent to choosing pj = i/, where 
/ is the identity on Ti.. The quantum state ^7, known as 
the maximally mixed state, regularly serves in quantum 
mechanics to represent a complete lack of knowledge. 



For both strategies we address the special case where 
the uncertainty bounds are uniform in i {qi = q) . We find 
that for high values of q the worst-case optimal measure- 
ment coincides with the optimal nominal measurement. 
Also, in the equiprobable case, the optimal average mea- 
surement is identical with the optimal nominal measure- 
ment for any q > 0. 

Section contains the results and analysis of numer- 
ical simulations of several examples. We compare the 
characteristics and performance of the two approaches. 



II. FORMULATION OF THE DETECTION 
SETUP 

Assmne that a quantum channel is prepared in a quan- 
tum state drawn from a finite collection of quantum 
states. The quantum states are represented by a set of 
m PSD Hermitian density operators {pi}^i on an n- 
dimensional complex Hilbert space Ti.. The states pi, 
however, are not entirely known at the receiver, whose 
state of knowledge is defined by The receiver per- 
forms a quantum measurement 11, comprising m PSD 
Hermitian operators H = {H^}™ on Ti., in order to de- 
termine which of the messages was sent. 

We assume without loss of generality that the eigenvec- 
tors of the known density operators {Pi}iLi span [s^ H 
(in this case, the eigenvectors of qip'^ + {l — qi)pl also span 
Ti.). Under this assumption, the measurement operators 
Hi must satisfy 



H,; > 0, 



H,; = / 



(2) 



where / is the identity on Ti, in order to be a valid mea- 
surement. We shall denote the set of all m order POVMs 
n = {HJ™ 1 as 



n. >o, ^n, = / 



(3) 



Given that the transmitted state is pj , the probability 
of correctly detecting the state using measurement H — 
{Hi}™ is Tr(HjPj). Therefore, the probability of correct 
detection is given by 



PiTr(njPi) 



(4) 



with 



where pi > is the a-priori probability of pi 

When the states are known, we may seek the measure- 
ment H e _B that minimizes the probability of detection 
error, or equivalently, maximizes the probability of cor- 
rect detection. This can be expressed in the form of the 
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optimization problem 



III. OPTIMAL WORST-CASE DETECTION 



max < ^pjTr(n,;pi) 



(5) 



s.t. 



n, > 



(a) 



(b) 



li Qi — 1 for all Lso that pi is completely specified, then 



it was shown in 



that a measurement {n^}™ i G B 



solves (O if and only if there exists an operator U such 
that for all 1 < i < m 



U > PiPi 

{u - p,p,)n» = 



(6) 



(by the notation X > F we mean that X — Y is PSD). 
Throughout this paper we shall denote the measurement 
which solves problem ||SJ) by n'^Q™ — Il{pi,pi). 

In general, the optimal measurement will depend on 
the states pi. Because in our formulation they are un- 
known, new criteria for optimality must be defined. Be- 
fore doing so, we present a measurement, which is inde- 
pendent of the states pi, and will thus provide a lower 
bound on the optimal probability of correct detection, 
regardless of the criterion used. The following measure- 
ment is dependent solely on the prior probability distri- 
bution: for all i choose 



0, 



Pi Pmax 
Pi Pmax 



(7) 



where m„ 



< m stands for the number of states with 



prior probability Pmax- Using this measurement, the 
probability of correct detection is 



/ft "imax 

P, = ^p,Tr(n,p,) ^ J2 ^Tr(p,) = p„ 



(8) 



i=l 



This detector is effectively an unbiased guess from the 
subset of messages with maximal prior probability. 

Thus, probability of correct detection equal to Pmax 
can always be achieved. We seek a measurement that 
under conditions of uncertainty can perform better. In 
the following two sections, we derive necessary and suf- 
ficient conditions for optimal measurements, using two 
different criteria. These criteria refer to the probability 
of correct detection, while specifying a certain point in 
the region of uncertainty. In Section IlIII we present the 
optimal worst-case measurement, and in Section IIVI we 
propose an optimal expected measurement under the as- 
sumption of a probability distribution for the unknown 
states pj. 



In a worst-case or robust approach, we first find the 
point in the region of uncertainty that would, for a given 
measurement, yield the poorest outcome. We then solve 
the "original" optimization problem for this point. This 
criterion serves to assure that the probability of correct 
detection obtained using the optimal detector will not 
be lower than a certain value (the optimal value). An 
uncertainty model such as ours, that assumes very little 
prior knowledge (only the bounds Qi), can be regarded as 
possessing a worst-case quality - adding prior knowledge 
will surely improve the performance of the optimal mea- 
surement. This observation makes this specific criterion 
especially interesting. 

Using the uncertainty model ^ and problem ||SJ|, the 
worst-case measurement is the solution to 



maxmin <^ y2p^TT[U,{q,p"^ + (1 - *)/?,)] 
"^"^ U=i 



(9) 



s.t. 



Tr(pi) = 1 



(a) 
(b) 



where the constraints represent valid measurements, and 
the region of uncertainty We shall denote the optimal 
worst-case probability of correct detection as Pj^^(g). 

We begin by proving our first result, stated in Theorem 
^ We then explore in further detail the special case in 
which qi = q for all i. 

Theorem 1. Let {pi — qip^ + (1 — qOPi }"=i a set of 
quantum states, where pf and < < 1 are known and 
the states pj are unknown. Each state has prior proba- 
bility pi. Denote ri{q) ^ J2^iPiQi! and define "effective 
probabilities" 



Pi 



Denote the optimal measurement on the "effective en- 
semble" (the states {Pi}i^i with prior probabilities pi) 



n = n(p",pi) = argmax 
neB 



^p,Tr(n,pO) 
.1=1 



and the probability of correct detection achieved by 11 on 
the effective ensemble 

m 

Prf = ^j5,;Tr(II,:p°). 

4=1 

The quantum measurement n^*"" — {n)^'~"}™ that min- 
imizes the worst-case probability of a detection error is 



where IL is defined in Q . 



viQ)Pd > Pn 
viojPd < Pn 
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The worst-case probability of correct detection is 



Pn 



nwc 



Proof: The internal minimization in problem ||^ can be 
written as 



min \ ^p,teTr(n,;pO) + (1 - ft)Tr(n,pi)] 



(10) 



s.t. 



Tr(pi) = 1 



(a) 
(b) 



where the parameters {11^}™]^ are all PSD. 

Since the objective function is dependant on p\ only 
through the expression on the right, and since it is also 
separable in i (the objective is additive and the con- 
straints independent), the optimization is reduced to 
solving TO cases of the form 



min Tr(np) 

p 



(11) 



s.t. 



P > 



(a) 
(b) 



Jr(p) = 1 

By writing p as a convex combination of pure states 

n 

p^Y.aAH){H\ (12) 

the problem ({TT|l can be recast as 



(13) 




s.t. { '-^^ ' 



(a) 
(b) 



For each j, the minimal value of (/ij |n|/ij) is the minimal 
eigenvalue of H, (denoted A^'") and is achieved for a state 
l/Xj), which lies in the corresponding eigenspace. The 
optimal value of ((T^ is therefore 9j^n™ = '^n'" and 
is achieved for a state p whose range-space lies entirely 
in the associated eigenspace. 

Using this result in (|10|l . the original problem is 
equivalent to 



PT^{q) = max |XJp.feTr(n,p°) + (1 - g.)Ar]| 

(14) 

By utilizing the fact that the minimal eigenvalue of a 
Hermitian operator 11 can be written as the solution to 



f max A 
" 1 s.t. n > A/ 



(15) 



problem H14() becomes 



pWC 



iq) - max |^p,teTr(n,pO) + (1 - g,)A,]| (16) 



n, > A,/ > (a) 



s.t. 



(b) 



The objective function in problem H16|l is linear and the 
constraint are all linear matrix equalities and inequalities, 
making it a convex problem. For convex optimization 
problems which are strictly feasible (i.e. the feasibility 
set has a non-empty relative interior) , necessary and suf- 
ficient conditions for optimality are given by the Karush- 
Kuhn-Tucker (KKT) conditions 13]. In Appendix 1X1 we 
derive the KKT conditions for the specific problem H16|l : 
A measurement 11 G _B with bounds Ai > are optimal 
if there exists a Hermitian operator U satisfying 



Tr(C/) > 

([/-p,g,p°)(n,-A,/) = o 

(TiiU) - p,)K = 



(17) 



Moreover, from Lagrange Duality we know that the 
optimal values of Hi , A; and U obey the relation 



P]^C(g)=Tr(C/) 



(18) 



From (|18|l we see that the aforementioned lower bound 
Pmax on the optimal probability of correct detection is 
manifested in the requirement Tr([/) > Pmax ^ Pi- In 
(UI) we give a measurement which achieves Pd 
and shall therefore continue our analysis of the worst- 
case under the assumption Pj^'~^{q) > Pmax- 

With this extra demand in place, the last necessary 
condition in H17|l can only be satisfied if for all i, the 
eigenvalue bounds Ai equal zero. The necessary and suf- 
ficient conditions (|17|) reduce to (after ignoring the now 
redundant constraints) 



U > PiqiPi 

(U - p^q^p'i)n, = 

Denoting 77(g) = J2iPi1i^ and 

PtQi 



Pi 



(19) 



(20) 



these conditions (with (l/ry)C7 in place of U) are identi- 
cal to the necessary and sufficient conditions ©, for the 
known states p^ with prior probabilities pi. □ 
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The larger qi is, the greater the ratio "Pi/pi between 
the "effective" prior probabiHty and the real one. When 
assuming Pj^'~^{q) > Pmax, the optimal worst-case mea- 
surement n^*-" is biased towards detecting the states with 
less uncertainty. 

Intuitively, with no knowledge at all about the states 
pj, all that can be inferred about the uncertain part of 
the ensemble relies on the prior probabilities. This is 
the reason that the uncertain ensemble is (in terms of 
optimal detection) equivalent to an ensemble comprised 
of the known states with altered prior probabilities. 

In the extreme case where a state pj is completely un- 
known (qj = 0), the optimal worst-case detector ignores 
it entirely (because pj = 0) and attempts to distinguish 
optimally between the remaining m — 1 states. 

The next two corollaries give the probability of correct 
detection when the known states are mutually orthog- 
onal. 

Corollary 1. //p^ are mutually orthogonal, and for alii, 
qi > 0, then the worst-case probability of correct detection 
is 

= max{pmax,7?(g)} 

Proof: Provided that the effective prior probabilities 
satisfy Pi > 0, when p^ are mutually orthogonal they 
can always be correctly detected. Therefore Pd = 1, and 
the corollary follows immediately from Theorem ^ □ 

Corollary 2. Denote by Iq the index set of the states 
which are completely unknown, i.e. have qi = 0. Denote 
Pq = P{i € Iq) = X^ie/ Pi '^"''^ maximal prior proba- 
bility of a state from this subset pq = max{pi|i € Iq}. 

If pf are mutually orthogonal, and /q 7^ 0, then the 
worst-case probability of correct detection is 

Pf'iq) = max {p„,ax, 77(g) (1 - +Po)} 

Proof: The states p° with pi > Q are mutually orthog- 
onal, and can therefore be detected correctly. In other 
words, when i ^ Iq, the conditional probability of cor- 
rect detection is P{d\i ^ Iq) = 1. If i € /q, then the state 
must be guessed from within this subset. An optimal 
guess achieves P{d\i G Iq) = Po/Po- 

All in all, the optimal measurement on the effective 
ensemble achieves 



Pd - P{^ i h)P{d\i i lo) + P{i e Io)P{d\i e h) 
= (1 - Po) • 1 + Po^ = 1 - Po +P0 

-TO 



(21) 



The corollary follows immediately from Theorem^ □ 

Note that when there is one state with = 0, we get 
Pq = Po- The completely unknown state can be guessed 
by default. 



A. Worst-Case Detection with Uniform 
Uncertainty 

We now consider the special case in which the mixing 
bounds qi are uniform in i, i.e 



Pi = qPi + (1 - <i)p} 



(22) 



Corollary 3. Denote the optimal nominal measurement 
jjNom _ fi(^pO p.^^ 12^(1 ^fig nominal probability of correct 
detection pNom^ 

When the uncertainty is uniform, as in (|22|l . then the 
optimal worst- case measurement is 

Pmax 



n 



wc 



(g) = 



n, 



< g < 



Pn 



pNom 



pNom 
^d 

<q<l 



and the optimal probability of correct detection is 

P7C(5)^niax{p,„ax,9Pr'"} 

Proof: When the uncertainty is uniform for all i, the ef- 
fective probabilities defined in H20() are pi = Pi. The op- 
timal measurement on the effective ensemble is therefore 
the one which would have been optimal for the known 
states p^ with the original prior probabilities pi, had there 
not been any uncertainty, i.e. 11 = 11^°™. 
In addition. 



V 



(?) = ^PiQ = Q 



1=1 



The corollary then follows from Theorem ^ 



(23) 



□ 



Corollary |31 implies, that under uniform uncertainty 
with a large value of q, the best course of action in terms 
of worst-case performance is to ignore the uncertainty 
altogether. Nonetheless, although it is achieved by the 
optimal nominal measurement, the probability of correct 
detection itself is affected by the uncertainty (see Section 
EI). 

The complete symmetry in the uncertainty (total lack 
of knowledge about p} and equal mixing) does not bias 
the optimal measurement in any way, thus leaving it fixed 
with change in q, until the threshold is reached. 



IV. OPTIMAL AVERAGE DETECTION 

Optimality in the worst-case does not grant good per- 
formance throughout the region of uncertainty. Also, as 
seen in the previous section, the optimal worst-case mea- 
surement is sometimes quite pessimistic, altogether ig- 
noring the input state in favor of a guess. An alternative 
course of action is to define a distribution of probability 
over the region of uncertainty, thus enabling us to find a 
measurement, which on average maximizes Pd. 

Our model of uncertainty Q assumes a complete lack 
of knowledge about the states pj . This suggests two at- 
tributes of the probability distribution we shall define: 



6 



1 . The different unknown states are statistically inde- 
pendent of each other, 

2. For each i, p\ is distributed uniformly over the en- 
tire set of n-dimensional quantum states. 

A pure random state is equivalent to rotating an arbi- 
trary state using a random rotation — U\plq). Thus, 
the probability distribution of a "uniformly distributed" 
random pure state can be defined using the uniform mea- 
sure on the group of order n rotation operators SU (n) , 
the well known Haar measure (2^ 1291] . There does not, 
however, seem to be any natural uniform measure on the 
set of mixed states [23 (i.e. it is not simple to provide 
rational arguments for the superiority of a given mea- 
sure). Many attempts to define such a distribution rely 
on product measures js^,!^, whereby a random diagonal 
operator G, distributed using a measure on the simplex of 
eigenvalues, is rotated using a random unitary operator 
U , distributed using the Haar measure 

Prand = UGU* (24) 

Writing, the unknown states p\ in similar fashion 

p\ - U^GrU* (25) 

we shall assume that each of the operators Ui is a ran- 
dom rotation operator distributed according to the Haar 
measure. For all i, Ui are statistically independent. We 
assume nothing about the operators Gi except that they 
are valid quantum states {Gi > 0; Tr(G'i) = 1). 

Lemma 1. Let p be a quantum state that lias undergone 
a random rotation U 

p = UGU* 

wtiere U is distributed using the Haar measure. Then 
for any Hermitian operator H, the expectation value of 
Trijlp) is 

(Tr(Hp)) = iTr(H) 
n 

The proof of Lemma Q is given in Appendix IbI 
For a pure state p this result can be intuitively 
understood. The direction of the normalized vector is 
isotropically distributed, and so the average measure- 
ment result of any operator on it is the average of the 
possible outcomes (eigenvalues of the operator). Lemma 
1 states that this is also true for mixed states. 

We now wish to find the detector which maximizes the 
average probability of correct detection under the above 
probabilistic model, i.e. we wish to find 

??|| I^Pi |fjKTr[n,(g,p^ + (1 - g.)p')]| j (26) 

The main result of this section is summarized in The- 
orem |21 



Theorem 2. Given an ensemble of m quantum states 

{P^ = q^P^+i'^-q^)U^G,U*}Zl, whcrC p° and < Qi < 1 

are known, and Ui are statistically independent random 
unitary matrices distributed using the Haar measure, with 
prior probabilities pi, obtaining the quantum measure- 
ment H that minimizes the average probability of a de- 
tection error is equivalent to obtaining the optimal mea- 
surement for the ensemble {qipf -f ^—^I}iLi with prior 
probabilities pi . 

Proof: Using Lemma 1, the average probability of cor- 
rect detection is given by 



i=l 



g,Tr(H,p^) + (l-gO(Tr(H,pi)) 



g,Tr(H,p^) + ^^Tr(H, 



(27) 



5]p.Tr[n,(q,pO + i^/)] 



i=l 

Maximizing H27|l requires finding the optimal detector 
designed for states of the form pi = g^p^ 4- ^—^I with 
prior probabilities pi. Using our established notation, 
HAvg^fl(g.pO + i^/,p,). □ 

Stating Theorem|21in other words, maximizing the av- 
erage probability of correct detection involves replacing 
the uncertainty with maximally mixed states. The maxi- 
mization can done by directly solving the SDP jSjl , or by 
utilizing one of the closed form solutions, if applicable to 
the new states qip^ -\- ^-^I- 

The effective ensemble used in producing the optimal 
average measurement has the original prior probabilities. 
Therefore, the performance of the outcome of the op- 
timization is bounded below by the performance of the 
measurement in Q. This implies, that the average per- 
formance is bounded below by Pmax, but says nothing 
about the performance of this measurement in the worst 
case (see numerical results in Subsection IV A|) . 



A. Average Detection with Uniform Uncertainty 

When qi ~ q, the optimization problem that must be 
solved in order to find the optimal average detector is 

{m 1 1 

q P.Tr(H,p°) + — ^ J2 P^Tr(H,) \ (28) 
i=i 1=1 ) 

Writing the measurement operators as H^ = a^^'i, 
where Tr(\l/j) = 1, and ai = Tr(Hi), we can regard ^'^ 
as the "geometry" of the measurement operator, and ai 
as the "relative importance" of each operator. Note that 
the posteriori probability of detecting the z-th message 

m 

P(z) = J2 PjTr(H,p,) - a,Tr (vj/, p,p,) (29) 
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is proportional to Ui . 

Stating problem H28I) in these new terms, one gets 

^ — ' n ^ — ' 

1=1 i=l ) 



(30) 



> Tr(*,0 = 1 (a) 



s.t. 



(b) 



■ i=l 



The "geometry" of the measurement are determined 
only through the left-hand expression. As q grows smaller 
(more uncertainty) , more importance in determining the 
optimal measurement is given to the prior probability 
distribution, with less regard to the states p° themselves. 

When the different messages are equiprobable = ^ , 
the right hand term in becomes a constant (with 
the value ^^). The optimization problem reduces to 
finding the optimal measurement in the nominal case. 
Therefore, for all g > the optimal average measurement 
is identical to the optimal nominal measurement. Again 
we find that the optimal course of action is to simply 
ignore the uncertainty when designing a measurement. 

Numerical solutions of 10 for the states qp^ + ^-^I 
reveal that when the prior probabilities Pi are not equal, 
many ensembles exhibit a certain value of above which 
the optimal average measurement is equal to the optimal 
nominal measurement. This property, however, is not 
universal. 



NUMERICAL EXAMPLES AND 
DISCUSSION 



Table U contains a summary of the results obtained in 
the previous sections. We have defined three criteria of 
optimality - nominal, worst-case and average. Through- 
out the following, we denote the three respective optimal 
measurements as nN°'",_nWC^ and H^^s. We also re- 
fer to the measurement 11, defined in 0. The effective 
prior probabilities pi were defined in (|20|) . In this section 
we aim to demonstrate the characteristics of the different 
measurements we have defined, and the relations between 
them, via the solutions for a specific ensemble. 

The optimal measurement for each criterion was com- 
puted by explicitly solving the optimization problems (O 
and Hlt)|) for the relevant cases. The computation was 
done using the SeDuMi toolbox in Matlab. 



A. A Three-State System with Uniform 
Uncertainty 

We examine an ensemble comprised of the pure states 
p^ = \ui){ui\ in a three dimensional Hilbert space. 



1 



Ui 



V5 



1 U2 



1 

V5 



1 
72 



-l| (31) 



I Us 

with prior probabilities 

pi = 0.2 p2 = 0.3 P3 = 0.5 (32) 

under conditions of uniform uncertainty {qi = q) . 

Figure ^ shows the probabilities of correct detection 
using the different measurements as a function of q. The 
results referred to as 'BC and 'WC stand for 'best-case' 
and 'worst-case' respectively [s^- Also shown are the 
results for the nominal states, that is p\ = p^, and the 
results for pj = ij (denoted 'MM'). 

The optimal nominal measurement n^°™ is indepen- 
dent of q. Therefore the corresponding probabilities of 
correct detection, which are given by expressions of the 
form 



^pNom ^ _ g)P(n^°'", plin^""")) 



(33) 



behave linearly in q. 

As expected from Corollary O the optimal worst-case 
measurement shows two distinct regions of be- 

haviour. One can verify the result by noting that for 
q < Pmax/ P^°™ the optimal probability of correct detec- 
tion is Pd{q) = Pmax regardless of the choice of pj (in 
particular the worst and best cases are equal), whereas 
for q > Pmax/Pd°"^ the probabilities coincide with those 
obtained by 0^°™. 

The rightmost plot in Figure shows the results ob- 
tained using the optimal average measurement II^^s^ por 
high uncertainty (low q) this measurement also does not 
improve on the lower bound Pmax- Using the same ar- 
gument as above, we conclude that in this region of q 
jjAvg ^ jj jv^Q^g ^^g^^ contrary to 11^^, the lower bound 
measurement does not appear explicitly in solution to the 
problem of optimal average detection. 

For optimal average detection, the transition to 
jjAvg _ Yl occurs at a lower value of q compared to 
jjwc interpretation is that n^^s jg ^ less pessimistic 
measurement, relying on the input under conditions of 
uncertainty where already regresses to guessing. 

An important feature is that when q is high enough so 
that n^^8 be gins to be dependant on the states them- 
selves (and not only on the prior probabilities) , although 
the objective P^^^ improves monotonically, the worst- 
case probability of correct detection does not. In partic- 
ular, there is a region of q where Pj^'~^ < Pmax- 

This illustrates the fact that the results of each mea- 
surement are highly dependant on the location of pj in 
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TABLE I: Summary of the optimal measurements using different criteria 





Different mixing 


f bounds qi 


Equal 


mixing 


bounds {qi = q) 


Optimization 
criterion 


Measurement 


Optimal Pd 


Measurement 


Optimal Pd 


max Pd in 
nominal case 


jjNom A fl( X 


m 

pNom A mTrin^"'^ 0°) 

d / J Hi ) 


SIMILAR 


max Pd in the 
worst case 


or 

n 


m 

/ — *i X ~ m / I I () \ 

2^ P«Tr(n,p, ) 
1=1 


or 


j-jNom 

n 


qPd°" 

pmax 


msLxPd on 








jjAvg 




average 






if "ii Vi — — 


-j-j-Nom 





"One must choose between the two measurements, according to 
Pd achieved by each. 




FIG. 1: Probabilities of correct detection using the different optimal measurements as a function of q, for the ensemble defined in 
I31I I: (a) optimal nominal measurement; (b) optimal worst-case measurement; (c) optimal average measurement. The different 
lines in each graph are different manifestations of the unknown states p}, where the abbreviations are defined in the text. 



the region of uncertainty. The optimal average solution 
may have a very bad worst-case, and the optimal worst- 
case solution may lead to poor detection on average. In 
general, the designer of a specific setup must make similar 
calculations and use cost-benefit considerations in order 
to choose between the available 'optimal' options. Hence, 
the choice of measurement that will be eventually used 
is dependant on the specific application at hand (the in- 
tended use of the apparatus, the known states and 
probabilities pi, and the mixing bounds qi). 

Further insight can be revealed by examining 
the distances between measurements. Denoting 
n = [ill ■ ■ • Hm] (the matrix whose columns 

are the measurement operators), we use the Frobenius 



norm to define the measurement distance 

L'(n,1') = ||n- ^-IIf (34) 

and the measurement difference 

m{q)^\\\l{q)-\l{q-5q)\\F (35) 

The measurement difference can be thought of as a gra- 
dient of the distance from a reference measurement. 

The left side of Figure H shows 15(0^0^ n^om) 
^jjWCj-^-j fpj^g g|-gp gj^g used in the calculations is 
5q — 0.005. Because n^°™ is not a function of q, any 
change in the distance is due to change in . One can 
clearly see the discontinuous change between the region 
q < Pmax/ Pd°™ , where = n, and the region of low 
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0.2 OA 0.6 0.8 



(a) 



0.2 0.4 0.6 o.a 



(b) 



0.2 0.4 o.e 0.8 



(c) 



0.2 0.4 0.6 O.a 



(d) 



FIG. 2: Measurement distances and differences as a function 
of for the ensemble defined in l|31jl : (a) distance between 



and n""""; (b) change in H^^; (c) distance between 
n*"*^ and n''°'"; (d) change in Tl^' 



APPENDIX A: KKT CONDITIONS FOR THE 
WORST-CASE PROBLEM 

The Lagrangian of problem Hlt)|) is 
L = - ^KTr [{qU, + (1 - q)A,/) - ^ wA^ 



J2 Tr [Z,{n, - XJ)] + Tr U{I - 



4=1 



4=1 



(Al) 



where > 0, > and [/ are the Lagrange multipliers. 

The KKT necessary and sufficient conditions for opti- 
mality are given by 



En. 



4=1 



Z,. > 



U, > X,I > 



w, > 



dL 

— = -p,q,p"^ -Z, + U = 

an, 

dL 

— = -Pi{l - Qi) + Tr(Zi) -Wi =0 

z,(A,/-no = 



(A2) 

(A3) 

(A4) 
(A5) 

(A6) 



uncertainty where n = jjNom^ rpj^^, rightmost plots 
show D(n^^g,nN°'°) and (5n^^s(g). One can see that 
n^'^s exhibits both continuous and discontinuous change. 
Moreover, in this example, for high values of q, we find 
that i:)(n^'^s,nN°m) = O. This is characteristic of many 
ensembles, although as stated above, is not universal. 



VI. CONCLUSION 



W^X^ = (A7) 

From (|A4|) and ljA5|l we find that at the optimum 

Z, = U- p.,q,p1 (A8) 
iLH = Tr(C/) - (A9) 

Using these relations, the KKT conditions can be recast 
in the form (|17|l . 



We considered the discrimination of quantum states, 
drawn from a finite set with known prior probabilities, 
where the states themselves are not entirely known. We 
derived two sets of necessary and sufficient conditions for 
the optimality of a quantum measurement in discriminat- 
ing between the states. The first in the sense of minimal 
worst-case probability of a detection error, the second 
in the sense of minimal average probability of detection 
error. In both cases, the uncertainty is manifested as 
optimally discriminating among an "effective ensemble" . 
We found that under our model, when the uncertainty 
is of uniform magnitude for all states, one can, in many 
cases, ignore it altogether. 

Possible avenues for further work are looking into 
structured uncertainty - where the unknown states p] 
are known to some extent (for example their range-space 
is restricted to a certain subspace), and leakage between 
the possible states, i.e. the unknown states are a mixture 
of the known states p\ = hjp^. 



APPENDIX B: PROOF OF LEMMA 1 

Given a random pure state with uniform distribu- 
tion over the unit sphere, and an arbitrary orthogonal 
basis {|7rfe)}5!^j^, the n probabilities 

ak = \{7rk\t^)\^ (Bl) 

form a random vector in the n—l dimensional simplex B, 
defined by the conditions ct/c > for all k and ak = 1- 
The distribution of a is given by (see psHs^ ) 

= (B2) 

where {yk}k=i independent random variables obeying 
an exponential distribution with parameter 1. 

Due to the fact that B, the domain in which a is dis- 
tributed, is a convex set, the expectation value of a must 
lie in B, i.e. (<?) S B. The distribution of a is, of course. 
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symmetrical with respect to exchange of any of its co- 
ordinates ((Tfc <-*■ (Tfc/), and then so must be the average. 
The only such symmetrical point in B is ( ^ , . . . , ^ ) . And 
so 



n 



(B3) 



Given a quantum state that has undergone a random 
rotation U 



p = UGU* 



(B4) 



we can assume without loss of generality that the state 
G is diagonal, thereby permitting us to rewrite p in the 
form 



(B5) 



with 5j ^ 1 ^11"^ il^jll^j') ^ ^jd'- We now express 

using their harmonic expansions in the eigenvectors 

of n. 



fc=l 



(B6) 



where Itt^) is the eigenvector of 11 corresponding to eigen- 
value Afe. Substituting (jBGp in (|B5|) we get 



(B7) 



n n n 

p = X! 5j X! "jfe Kfe) X! "i*:' 

j=l k,k' = l 



and 



n n 



Tr(np)-^ J2 9ja,ka*,,TT{n\TTk){nk'\) 

11 n 

j=l k,k'=l ^"7"' ' 



(B8) 



3 = 1 k=l 

Although l/ij) are not probabilistically independent 
(they are mutually orthogonal), their marginal distri- 
butions are all uniform. The expectation values of the 
squared modulus are (from IjBSp l 



(B9) 



which in turn leads to 



(Tr(np))=5:^ 



i=l k=l 

n 11 



-E5.E^fc--Tr(n) 



(BIO) 



3 = 1 k=l 
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